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Abstract — In wireless networks, the transmission rate of a 
link is determined by received signal strength, interference from 
simultaneous transmissions, and available coding-modulation 
schemes. Rate allocation is a key problem in wireless network 
design, but a very challenging problem because: (i) wireless 
interference is global, i.e., a transmission interferes all other 
simultaneous transmissions, and (ii) the rate-power relation is 
non-convex and non-continuous, where the discontinuity is due 
to limited number of coding-modulation choices in practical 
systems. In this paper, we propose a distributed power control 
and coding-modulation adaptation algorithm using annealed 
Gibbs sampling, which achieves throughput optimality in an 
arbitrary network topology. We consider a realistic Signal-to- 
Interference-and-Noise-Ratio (SINR) based interference model, 
and assume continuous power space and finite rate options 
(coding-modulation choices). Our algorithm first decomposes 
network-wide interference to local interference by properly 
choosing a "neighborhood" for each transmitter and bounding 
the interference from non-neighbor nodes. The power update 
policy is then carefully designed to emulate a Gibbs sampler 
over a Markov chain with a continuous state space. We further 
exploit the technique of simulated annealing to speed up the 
convergence of the algorithm to the optimal power and coding- 
modulation configuration. Finally, simulation results demonstrate 
the superior performance of the proposed algorithm. 

I. Introduction 

Wireless communications have become one of the main 
means of communications over the last two decades, in the 
form of both cellular (WWAN) and home/business access point 
(WLAN) communications [1|. Recently, with the development 
of data centric mobile devices, e.g., iPhone, we have seen a 
renewed interest in enabling more flexible wireless networks, 
e.g., ad hoc networks and peer to peer networks fl3l . A 
key problem in the design of ad hoc wireless networks is 
link-rate control, i.e., controlling transmission rates of the 
links. In wireless networks, the transmission rate of a link 
is determined by received signal strength, interference from 
simultaneous transmissions, and available coding-modulation 
schemes. Because wireless interference is global, and the 
rate-power relation is non-convex J2[ and non-continuous, 
distributed link-rate control in ad hoc wireless networks is 
a very challenging problem. 



One approach to tackle the link-rate control problem in the 
literature is to assume that the coding-modulation scheme is 
predetermined, i.e., all links use the same coding-modulation 
scheme. In this case, there is an SINR threshold associated 
with each link, and a transmission over the link can be 
successfully decoded when the actual SINR is above the 
threshold. This assumption is reasonable for voice-centric 
wireless networks where the same voice codec is used at 
all devices. Under this assumption, the link-rate control is 
translated into a power control problem where the objective is 
to find a set of minimum transmission powers such that the 
SINRs at all links are above the thresholds, if possible at all. 
This problem has been well studied in the context of the power 
control in cellular communications lfl4l and simple iterative 
algorithms can be shown to converge to the optimal feasible 
power allocations. 

Another approach in the literature is to assume the link rate 
is a continuous function of the SINR of the link 0, ifTTl . 
For example, a model that has been extensively adopted is 
to assume r a b = |log 2 (l + SINR(ab)), where r a b is the 
transmission rate of link (ab). In other words, it assumes that 
for each SINR level, the capacity achieving coding-modulation 
is available. Under this assumption, the rate control problem 
again is formulated as a power control problem where the 
objective is to find a set of powers to maximize system 
utility defined upon achievable rates t/ a b( r o&), where U a t(-) 
is the utility function associated with link ab. This problem 
is also well understood in cellular networks given the recent 
advances in optimal power control and rate assignment[4|, 0, 
where distributed iterative algorithms are shown to converge 
to the utility maximizing power allocations, after introducing a 
small signaling overhead to the cellular air interface. However, 
these approaches ignore the non-convex nature of the problem 
and the algorithms proposed here converge to the utility 
maximizing operating point on Pareto boundary of the rate 
region, assuming all devices have to transmit all the time. In 
the context of ad hoc networks, such approaches can be highly 
sub-optimal since the time-sharing, or inter-link scheduling, 
nature of the problem has to be considered due to highly non- 
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convex nature of the rate-power function. Towards this end, 
queue-length distributed scheduling is shown to be through- 
put optimal ID, Q, HO), 0, through the use of MCMC 
(Markov Chain Monte Carlo) models. These results however 
assume collision-based interference model, which in general 
is over-conservative, and assume fixed transmit power and 
coding-modulation scheme. Both transmit power and coding- 
modulation can be adaptively chosen in practical systems. For 
example, in 802. llg, eight rate options are available. 

In this paper, we extend the framework in 0, Q, iflOl . ll9ll 
to develop a distributed joint power control and rate scheduling 
algorithm for wireless networks based on the SINR-based 
interference model. We assume that each node has a finite 
number of coding-modulation choices, but can continuously 
control transmit power. We propose a distributed algorithm 
that maximizes the sum of weighted link rates 53( a 6) w ab r ab, 
where r ab is the rate of link ab. The main results of this paper 
are summarized below: 

• We consider realistic SINR-based interference model, 
where a transmission interferes with all other simultane- 
ous transmissions in the network. Our algorithm decom- 
poses network-wide interference to local interference by 
properly choosing a "neighborhood" for each node, and 
bounding the interference from non-neighbor nodes. 

• We assume continuous power space and finite coding- 
modulation choices (rate options). The objective of the 
algorithm is to find a power and coding-modulation 
configuration that maximizes the sum of weighted link- 
rates 

max P,m Y,(ab)e£lab(t)r ab {m ab ,p) 
subject to Eb-.{ab)e£ Pob < PT X , Va G V, 

where q ab (t) is the queue length of link (ab) at time 
slot i[] p is a vector containing the power levels of all 
the links in the network, p max is the maximum power 
constraint, and m a b is the coding-modulation scheme. 
Due to the nonconvexity and discontinuity of r Q b(-), 
optimization problem ([T| is very hard to solve in general. 
Motivated by recent breakthrough of using MCMC to 
solve MaxWeight scheduling in a completely distributed 
fashion, we propose a power and coding-modulation 
update algorithm that emulates a Gibbs sampler over a 
Markov chain with a continuous state space (the power 
level of a transmitter is assumed to be continuously 
adjustable). 

• The algorithm based on the Gibbs sampling may be 
trapped in a local-optimal configuration for an extended 
period of time. To overcome this problem, we exploit 
the technique of simulated annealing to speed up the 
convergence to the optimal power and coding-modulation 
configuration. The convergence of the algorithm under 
annealed Gibbs sampling is proved. From the best of our 

'We use q a b(t) as the link weight so that the algorithm is throughput 
optimal when problem is solved at each time slot. From the best of our knowl- 
edge, this is the first joint power-control and coding-modulation-adaptation 
algorithm. 



knowledge, this is the first algorithm that uses annealed 
Gibbs sampling in a distributed fashion and has provable 
convergence. 

II. System Model 

We consider a wireless network with single-hop traffic 
flows. The network is modeled as a graph Q = (V, £), where 
V is the set of nodes, and £ is the set of directed links. Let 
n = \£ | denote the number of links. We assume that time is 
slotted. Each transmitter a maintains a buffer for each outgoing 
link (ab), if there is a flow over link (ab). The queue length in 
time slot t is denoted by q ab (t). Each transmitter a has limited 
total transmit power p™ ax , and p ab (t) denotes the transmit 
power of link (ab) at time slot t. 

We assume all links have stationary channels, and each 
transmitter a can tune its transmit power continuously from 
to pjf ax , but the number of feasible coding-modulation choices 
is finite. Each coding-modulation associates with a fixed data 
rate, and a minimum SINR requirement. Thus, the data rate 
of a link is a step function of the SINR of the link. The SINR 
of link (ab) is 

„ M Pab(t)gab 

lab(t) = — 7-T , (2) 

n b + 2^(xy)££,(xy)^(ab) VxyKpjQxb 

where n b is the variance of Gaussian background noise expe- 
rienced by node b, and g ab is the channel gain from node a 
to node b. In this paper, all n&s and g a i,s are assumed to be 
fixed. 

Denote by r ab (t) the transmission rate of (ab) at time slot 
t, and A ab (t) the number of bits that arrive at the buffer of 
the transmitter of link (ab) at the end of time slot t. Then, the 
queue length q ab (t) evolves as following: 

q ab (t + 1) = [q a b(t) - r ab (t)] + + A ab (t), (3) 

where [x] + = max{0,a;}. 

Let V C W 1 denote the set of all feasible power configura- 
tions of the network, i.e., 

V = {p: ]T Pab<pT X ,Pab>0}. 
b:(ab)e£ 

For each link (ab), given a power configuration p, the SINR 
of the link j ab is determined by equality Q. The transmission 
rate can be written as ru b \(m a b, p), where m ab is the coding- 
modulation scheme. 

In this paper, we assume a transmitter always selects the 
coding-modulation scheme with the highest rate under the 
given SINR. So m ab is a function of p, and rate r ab can 
be written as a function of p : r ab = r ab (p). Then, we define 
1Z as the set of achievable rate vectors under feasible power 
configurations and modulations, i.e., 

K = {r(p) :peV}. 

The capacity region of the network is the set of all arrival 
rate vectors A for which there exists a power control algorithm 
that can stabilize the network, i.e., keep the queue lengths from 
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growing unboundedly. It is well known that the capacity region 
is Oil: 

A = {\\3fiECo(K),\^ fi}, (4) 

where Co(1Z) is the convex hull of the set of achievable rates 
with feasible power configurations, and -< denotes compo- 
nentwise inequality. A power control and coding-modulation 
adaptation algorithm is said to be throughput optimal if it can 
stabilize the network for all arrival rates in the capacity region 
A. 

It is well-known that if a rate control algorithm can solve 
the following Max Weight problem 1 12 1 for each time slot, then 
the algorithm is throughput optimal. The focus of this paper is 
to develop a power-control algorithm and coding-modulation 
adaptation algorithm to solve the following MaxWeight prob- 
lem: 



maX J2(ab)e£ r ab{p)qab{t) -eE(a6)G£Pa6 

subject to p£P. 



(5) 



Recall that since r (,(p) is a step function of p a b, multiple 
power configurations may result in the maximum weighted 
sum. We therefore added a penalty function — e 53( a fe) P(ab) 
with a small e in the objective function to minimize the 
aggregated power when the weighted sum is maximized. 

III. Algorithm 

We are interested in obtaining the optimal power, coding, 
and modulation configuration that maximizes the weighted- 
sum-rate while minimizing the total transmit power. We can 
solve this problem by constructing a Markov chain whose state 
is the power configuration, and the stationary density satisfies 

7r(p) = — i— - e y (S(ab)E£ r ^(p)Q a b(t)-^T, { a b)< =e P*b) (g) 

Then, letting K — > 0, with probability one, the Markov chain 
is in state p*, the optimal solution to problem (|5j. 

Gibbs sampler is a classical way to construct such a Markov 
Chain with stationary distribution (|6j. Given the current state 
p(i) = p, the Gibbs sampler selects a link, say (ab), in a 
predetermined order and changes the transmit power to p a b 
with probability 



K(Pab\P-ab 



^(Pab,P-ab) 
Jo" n{pabiP-ab)dpab 

where p_ & denotes the vector of transmit powers except that 
of link (ab). It can be verified that the stationary distribution 
of this Markov chain is (|6j. Therefore, if the power vector is 
updated according to this Markov chain, it will converge to 
p* with probability one when K — > 0. 

There are, however, several difficulties in using Gibbs 
sampler for distributed rate control in wireless networks. 

1) First, to compute the conditional distribution, link (ab) 
must know the rate of all the links in the network, which 
incurs significant communication overhead. 

2) Second, the updating sequence of a Gibbs sampler 
is predefined, which results in the need of a central 
controller. 



3) Further, when K is close to zero, 

, I \ ^(Pab,P-ab) 

TT(Pab\P-ab) = p max „ —~ 

J a ^{Pab,P-ab)dpab 

e (12 {l y)e£ r xy(Pab,P-ab)Qxy-e'E,lab)(i£ Pab) / K 

-> 1, 

for p a b such that 

Ixy e / , Pab- 
Pab — — 

(xy)££ (ab)Gf 

In other words, the power configuration may stay in 
a local optimum for a long period of time. This is in 
fact a critical weakness of MCMC methods. We will 
use simulated annealing technique in our algorithm to 
overcome this weakness. 

A. Neighborhood and Virtual Rate 

To overcome the global interference, we note that because of 
channel attenuation, interference caused by a remote transmit- 
ter in general is negligible. We therefore define a neighborhood 
for each node a. We say a node 6 is a one-hop neighbor 
of node a if ma,x{g a b, gba} > a, i- e -> the channel gain is 
above certain threshold. Denote by Af 1 (a) the set of one-hop 
neighbors of node a, where the superscript indicates it is the 
set of one-hop neighbors. We further denote by AT 2 (a) the set 
of two-hop neighbors of node a, i.e., node b belongs to Af 2 (a) 
if b $ Af 1 (a), and b ^ a. and there exists a node c such that 
ceM 1 (a) and be Af^c). 

Then, we define T a b(t) to be 

To&(*) = X] Pxy(t)g x b + n b 

(xj/):xSAA 1 (b) 

which is called noise+partial-interference at node b for link 
(ab). Notice that fib — rib + £ab, where £ a b is an upper 
bound on the interference experienced at node b from the 
non-neighboring transmitters of b. We assume £ a b is known. 
By including this upper bound £ a & in the SINR computation, 
we guarantee that the SINR of link (ab) is a function of 
its neighbors' transmit powers and is independent of non- 
neighbor nodes. This localizes the interference. 

Given the definition of noise+partial-interference, the virtual 
rate of link (ab) is defined to be 



fab(t) = Tab rn a b 



Pabgab 



(7) 



?ab(t) 

We will later discuss how to determine transmit power based 
on actual interference. 

Observe that although the power level is continuous, and 
the SINR of neighboring links are continuous, the virtual 
rate choices are discrete and finite. Suppose (ab) is changing 
its power and link (xy),y € Af 1 (a), is affected. For each 
coding-modulation scheme of link (xy), m € A4 xy , let 
lxy,m denote the SINR requirement of m, and T XVtm denote 
the corresponding partial-noise-plus-interference requirement. 
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Assuming the transmit powers of all other nodes are fixed, the 
power level p ab e {0,p™ a *}, such that T xy (p ab ) = T xy>m , is 
called a critical power, which is highest power node a can 
use for coding-modulation choice m to be feasible over link 

(xy). 

B. Decision Set 

To overcome the issue of predefined update sequence in 
classic Gibbs sampling, we adopt the technique proposed in 
[9 1 to generate a decision set at the beginning of each time 
slot. 

Definition 1. A decision set T> is a set of transmitters such 
that, for any two transmitters a and x in T), x <^ JV 1 (a) U 
TV 2 (a). 

Clearly, two transmitters a and x in the decision set are 
not one-hop or two-hop neighbors. In the proposed algorithm, 
only the links in the decision set are allowed to update their 
transmit powers. By properly generating a decision set, the 
evolution of power configuration is a reversible Markov chain 
with stationary density d5}. 

C. Required Information 

We further assume that node a has the following knowledge: 

• the channel gain from node a to its one-hop neighbors, 
i.e., g ay for all y £ Af 1 (a). 

• for each link whose receiver is a's one-hop neighbor, i.e., 
(xy) :yeM 1 {a): 

- the virtual transmit power of (xy), i.e., p xy (t — 1)0 

- the channel gain of (xy), i.e., g xy . 

- the virtual partial-interference-plus-noise of (xy), 
i.e., f xy (t)f\ 

- the queue length of (xy), i.e., q xy (t). 

- the feasible modulations of (xy), and the minimum 
SINR requirement, T xy m , of each modulation m. 

Notice that p xy (t — l),q xy (t), and T xy (t) change over time. 
We will explain the way node a obtains these values from node 
x in the algorithm. Further we assume all channel gains are 
known to a, and are fixed. The feasible modulations and the 
minimum SINR requirement for each modulation are assumed 
to be known a-priori, and do not need to be exchanged. 

D. Distributed Power Control and Coding Modulation Adap- 
tation Algorithm 

We now present the proposed algorithm, where the evolution 
of power configuration emulates a Gibbs sampler. To improve 
the convergence of the Gibbs sampler, we exploit the technique 
of simulated annealing Q. 

We group every T time slots into a super time slot. In each 
supper time slot, we run the algorithm for T times in the 
background of each node. In t th time slot of a super time 
slot, the value K is set to be K t — i og ^° +t ) , where K t is 
the "temperature" in the terminology of simulated annealing, 

2 the real transmit power is determined by the virtual transmit power 
Calculated based on virtual transmit powers 



and K is a positive constant that can be tuned to control the 
convergence of the proposed algorithm. The idea of the sim- 
ulated annealing is to start with a high temperature (large K) 
under which the Markov chain mixes rapidly. Then by slowly 
decreasing the temperature, the state of the Markov chain will 
converge to the optimal configuration. It has been well-known 
that annealing can significantly reduce the convergence time. 

All nodes maintain virtual power p, and the initial power 
configuration p a b(0) = 0, known by all the nodes in the 
network. The following algorithm describes the process of 
updating virtual power configuration following an annealed 
Gibbs sampler. The real transmit power is then determined 
based on actual SINR. 

At t th time slot of a super time slot, the algorithm works 
as follows: 

(1) Generating decision set: Each time slot consists of 
W control slots at the beginning. A decision set is 
determined at the end of the W control slots. Only 
the transmitters in the decision set update their virtual 
power levels at this time slot. At time slot t, transmitter 
a contends for being in the decision set as follows: 

(i) Node a uniformly and randomly selects an integer 
backoff time T a from [0, W — 1] and wait for T a 
control slots. 

(ii) If a receives an INTENT message from another 
transmitter x such that 

x£M 1 (a)\jM 2 (a) 

before control slot T a + 1, node a will not be 
included in the decision set in this time slot. Here, 
we assume the INTENT message from x has the 
id of x and the signal is strong enough, so that 
x's one-hop and two-hop neighbors, e.g., node a, 
know this INTENT message is sent by x. 

(iii) If node a senses a collision of INTENT messages 
from nodes x, x g Af 1 (a) UA^(a), a will not be 
in the decision set in this time slot. 

(iv) If node a does not receive any INTENT message 
from its one-hop or two-hop neighbors before con- 
trol slot T a + 1, node a will broadcast an INTENT 
message to its one-hop and two-hop neighbors in 
control slot T a + 1. 

i) If the INTENT message from node a collides 
with another INTENT message sent by node 
x £ J\f 1 (a) U Af 2 (a), a will not be in the 
decision set in this time slot. 

ii) If there is no collision, node a will be included 
in the decision set in this time slot. 

We note that W is selected to be large enough so that 
the collision of the INTENT messages happens with low 
probability. 

(2) Link selection: Let d a denote the outgoing degree of 
node a, i.e., d a — \{b : (ab) g £}\. In this step, each 
transmitter a e T) selects an outgoing link (ab) to update 
its virtual power p a b as following: 
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(i) If there was an active outgoing link (ab) such that 
Pab(t— 1) > 0, a will update the power of link (ab) 
in time slot t, with probability j-. 

(ii) If there was no active outgoing link (ab) such that 
Pab(t— 1) > 0, a uniformly randomly selects a link 
(ab) from its d a outgoing links, and then updates 
its virtual power p a &. 

(3) Critical power level computation: Node a computes 
the critical power level p Ctin (ab,xy) as follows: 
(i) Node a computes the critical partial-noise-plus- 
interference of link (xy) corresponding to each 



, m G M xy 



p xy (t -l)g x 

^xy ,m 



(ii) Node a computes the critical power of 
Pc,m(ab,xy), such that when link (ab) uses 
this power level, the resulting partial-noise-plus- 
interference of link (xy) is T xy , m : 

p Ctm (ab,xy) 



- mm < p 



Pab(t-l) + 



^xy(t-l) 



9 ay 



max 



(Some modulations of link (xy) need very high 
SINR, which cannot be achieved even link (ab) 
reduces its power to 0. For these modulations, we 
just let p cm (ab,xy) be zero, we will consider the 
critical power separately in the following step.) 
(4) Virtual rates computation: Now for each node a in 
the decision set T>, it computes the virtual rate of link 
(ab), and the virtual rate of the links whose receiver is 
a's neighbor as following: 

(i) Arrange the critical power levels 

{p c ,m(ab,xy),V(xy) s.t. yeAf 1 (a)} 
in ascending order, denoted by 

=Pc,0 < PcS < ■ <■ 

(ii) Compute the SINR of link (xy), when the power of 
link (ab) is zero: 

(t - l)g xy 

XV ^ X y(t - 1) - Pab(t - l)g a y 

Further, find the coding-modulation of link (xy) 
with the largest transmission rate corresponding to 
this SINR. Let the coding-modulations of all the 
neighboring links of link (ab) be denoted by a 
vector m°. 

(iii) Given this initial coding-modulation vector m°, a 
obtains the coding-modulation vector: 



corresponding to each critical power p c ,i,i 
1,2,---. 



(iv) Obtain the rates fi xy \(p Ct i) related to the coding- 
modulations fn % , xy \ of neighboring links, when 

Pab G \Pc,i,Pc,i+l), i = 0, 1, • • • . 

Note that for each link, the m \ xy ) is the coding- 
modulation scheme with the highest transmission 
rate assuming node a transmits with power p c ^. 

(v) Compute the virtual local weight, under each critical 
power level p c ^ : 

v a b(pc,i) = ^2 '•,„(/'•., )•(/,., 

yeAfHij.ixy^S 

for i = 0, 1, • • • , where q xy is the queue length at 
the beginning of the super time slot. 
(5) Power-level selection: Let Z a \, be the normalization 
factor defined as 



Zab - 



e K t 



e K t 



Node a first selects a power interval \p c ,i>Pc,i+i) with 
following probability: 

1 / g Pc,i «Pc,i + l 

Pr (pab G \p c ,i,p c ,i+i)) = Tr~ e At - e K ' 

^ab \ 

(8) 

Suppose the interval \p c ,k)Pc,k+i) is selected, then node 
a randomly selects a virtual power level p a b(t) according 
to the following probability density function (pdf): 



£ __!£_ / £ Pe,fc 'Pc,fc+1 

fab(p\p G \pc,kiPc,k+l)) = ~j£~ e l< ' 



(9) 

which can be done by using the inverse transform 
sampling method. 

(6) Information exchange: If the virtual power of a node 
has changed, i.e., p a b(t — 1) ^ Pab(t), then node a 
broadcasts p a b(t) to all its one-hop neighbors. Each 
neighbor y computes the virtual partial-interference- 
plus-noise of (xy), i.e., f xy (t). If f xy (t-l) ^ t xy (t), 
node y broadcasts T xy (t) to all its one-hop neighbors. 

(7) Update real transmit power: At the end of a super time 
slot, node a updates its transmit power for link (ab) to 
be p a b = p a b(T). Node b then measures the actual SINR 
7ab and reports to node a. Node a selects the coding- 
modulation scheme m a t with the highest rate among 
those m such that 7( a 6), m < 7afc- Packets of flow (ab) 
are transmitted with power p a i> and coding-modulation 
scheme m a b- Note that the real transmit powers are 
updated only once every T time slots. 

We now present a simple example to show how the algo- 
rithm works. 
Example: 

Consider the wireless network depicted in Fig. [T] There are 4 
links in the network. Assume the channel gain, background 
noises and queue lengths are: g a b = g c d — g e f = 1, 

9cb = g c f = \, rib = n d = n f = 1, and q ab (t) = 10, q cd (t) = 
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100, q e f(t) = 10- The virtual power level of the links in 
the previous time slot are p a b(t — 1) = 15,p c d(t — 1) = 

0, p e f(t — 1) = 10. Further, we assume that there are two 
feasible coding-modulation schemes for each link: BPSK with 
rate 1, and QPSK with rate 2, and the SINR requirement for 
the modulations are 4 and 8, respectively. 

© — <E) 

(s) K^) 

©^-© 

Fig. 1. A simple example 

In this example, we focus on link (cd) . Assume that the set 
of c's one-hop neighbors is A/' 1 (c) = {a, b, d, /}, and the set 
of two-hop neighbors is AT 2 (c) ~ {e, h}. 

Under this neighborhood structure, if c changes its power, 
it will change three links' virtual SINR, and their virtual rates, 

1. e., links (ab), (cd) and (e/). 

In the algorithm, node c randomly select a power level based 
on its interference to the neighboring links with each feasible 
power level. 

(1) Select decision set: Assume that the number of control 
slots is W = 5, and the backoff time generated by the 
transmitters are 



T a = 2,T c = 0,T e = 3,T g 



1. 



(2) 



(3) 



(4) 



Then, c broadcasts an INTENT message at control slot 
1, which is received by a and e. Node g will ignore this 
INTENT message even if it can receive it, because g is 
not within the two-hop range of c. In control slot 2, node 
g broadcasts an INTENT message, which is received 
by node e. Since both transmitters a and e receive the 
INTENT message sent by c, they are not in the decision 
set. And the decision set is {c,g}. 
Remark: Note that c's transmit power affects the virtual 
rate of links (ab), (cd), and (ef), while g's power affects 
the virtual rate of link (gh) only. Thus, no link's virtual 
rate is affected by both of c and g. 
Information exchange: Suppose the power level of link 
(ab) has changed in time slot t — 1, node a then has 
broadcast p a b(t — 1) to all its two-hop neighbors. Node 
c has received this message, which shows that c always 
knows the power level of (ab) and ef. 
Link selection: Each transmitter only has one link, so 
if a transmitter is selected to be in the decision set, its 
outgoing link will be selected. 

Critical power computation: Node c knows the vir- 
tual partial-interference-plus-noise experienced by links 



(ab),(cd),(ef) : 

T a fc(£ - 1) = n b + pcd9cb = 1 

T crf (t - 1) = U c +Pabgad+Pef9ed = 8.5 

f ef (t - 1) = n f +PcdfJcf = 1 

Then, node c can estimate its impact on links (ab) and 
(ef), when it varies transmit power from to p max . 
Fig. [2] illustrates this impact. We can see from the figure 
that there are 5 critical power levels besides and p max ; 
which are 1, 3.5, 6, 11, and 29. Take critical power level 
1 for example, it means that if the power of link (cd) is 
greater than 1, then the virtual SINR of link (ef) will be 
below 8, and link (ef ) will only be able to use BPSK. 



Critical power levels 




Fig. 2. Critical power levels 

(5) Virtual rates computation: Now, c knows the critical 
power levels and the coding-modulation schemes cor- 
responding to each interval between the critical power 
levels. Thus, c can calculate V c d(p c d) accordingly, which 
is shown in Table [I] Given these virtual rates, node c 
then samples a power level according to the distribution 
in equalities ([8]l and (|9j. 

E. Analysis 

In classical Gibbs sampler, the state of each link is updated 
in a sequential manner. In contrast, the Gibbs sampler used in 
our algorithm is parallelized and distributed, which leverages 
the distributed characteristic of wireless networks. In the 
following lemma, we prove that our algorithm generates a 
sequence of power configurations which form a Markov chain 
with some desired stationary density. 

Lemma 1. For a fixed temperature, i.e., without updating 
the temperature in the power control algorithm, and fixed 
queue lengths, the sequence of the power configurations, p(t), 
generated by the power control algorithm, forms a Markov 
chain with the stationary density: 



1 



V {!>)-' T:(qb)ee Pab 



Z(K t 
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TABLE I 

Critical power level and the resulted virtual rate 



Pcd 


[OA) 


[1,3.5) 


[3.5,6) 


[6,11) 


[11,29) 


[29,40J 


(fab(Pcd), fcdiPcd), ?ef(Pcd)) 


(2,0,2) 


(2,0,1) 


(1,0,1) 


(1,0,0) 


(0,0,0) 


(0,1,0) 


V cd {Pcd) 


40 


30 


20 


10 





100 



where Z(K t ) = j peV e 
malization factor. 



V(p)-g E( a b)g£ Pab 



dp is unknown nor- 



Proof: The proof is presented Appendix in [A] ■ 
In our algorithm, the virtual powers are updated using an 
annealed Gibbs sampler. Assuming the queues are fixed, the 
following theorem states that, with fixed queue length, the 
power configurations converge to the optimal solution to |5]) 
as t goes to infinity. Let U(p) = V(p) - e J2( a b)e£ P^b, and 

U* = maxf7(p),Z7* = mmUYp), A = 17* - U, 
per P £V 

Theorem 2. Let Kq = 2nA. Assume q a b are fixed and V*g is 
the set of power configurations such that for any p € Vg, 

J2(ab)e£ Qabfab{p) ~ £ J2(ab)e£ P~ ab 

>(1-S) maxp E( a6) e£ 9af,r a6 (p) - e £ (ab)e£ p ab - 

Then given any S > 0, e > 0, and starting from any initial 
power configuration, po £ P, there is an N G N such that if 



Kn 



0<t<N, 



we have 



K t = \ i _^iri_ N<t ' do. 

[ log(2+JV) JV ^ 1 



lim / P(t, p|0, Po)dp = 1 — e. 

t— >oo /-» 



Proof: The proof is presented in the appendix. The proof 
of the theorem follows the idea in 0. However, in our 
algorithm, the decision set is randomly generated instead of 
predetermined, and the Markov chain has a continuous state 
space instead of a discrete state space, so the convergence 
of the annealed Gibbs sampling is not guaranteed. The proof 
therefore is a nontrivial extension. ■ 

Remark 1: The theorem requires that the queue lengths are 
fixed during the annealing, which is the reason the algorithm 
uses the queue lengths at the beginning of a super time slot 
for the entire super time slot. 

Remark 2: In the algorithm, we replace the interference 
from non-neighbor nodes with upper bound £ . Therefore, when 
node a changes its transmit power to node b to p a b(T) at 
the end of a super time slot, the actual rate r a & is at least 
^ab{T) 1 because the actual interference is smaller than that in 
the virtual rate computation. Further when the neighborhood is 
chosen to be large enough, i.e., £ is small, the optimal configu- 
ration based on virtual rate is close to the optimal configuration 
with global interference. But a large neighborhood increases 
both the computation and communication complexities. 



IV. Simulations 

In this section, we use simulations to evaluate the perfor- 
mance of the proposed algorithm, which is SINR-based, with 
CSMA-based algorithm. In the simulations, we assume the 
channel attenuation over a distance I is l~ a , where the path 
loss exponent is chosen to be 3.5. All channels are assumed to 
be AWGN channels. The transmit power can be continuously 
adjusted from [0,100]. The rate options for each link are 6, 
9, 12, 18, 24, 36, 48, and 54 Mbps, which are the eight rate 
options available in 802.1 lg. The system is time-slotted, and 
each time slot is 1 ms. We assume each packet is of size 1500 
bytes, i.e., 12 Kbits. So when the link rate is 54 Mbps, 4.5 
packets can be transmitted in one time slot. 

A. A Ring Network 

Consider a ring network consisting of 9 directed links, as 
shown in Fig [3] Each node in the network has one transceiver. 
The length of each link is 20 meters. We assume the carrier 
sensing range is 40 meters, which is slightly larger than then 
distance between two nodes that are two-hop away. 




1 




Fig. 3. A ring network containing 9 links 

The arrival process is the same as the one described in |9). 
Namely, at time slot t, one packet arrives at the transmitters 
of links (i mod 9) and ((i + 4) mod 9); additionally, with 
probability p, one packets arrives at each transmitter. Hence, 
the overall arrival rate is 2 + 9p packets per time slot. 
In the simulation, we varied p from 0.10 to 0.26, which 
corresponds to varying the overall arrival rate from 2.9 to 4.34 
(packets/time slot). 

For each value of p, we run each simulation for 10 4 time 
slots. Figure [4] shows the mean of the sum queue length in the 
network. It shows that the sum queue length grows unbounded 
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under the CSMA-based algorithm when p > 0.11 (i.e., overall 
arrival rate > 2.99). In other words, the network is unstable 
under CSMA algorithm for p > 0.11. On the other hand, our 
algorithm stabilizes the network for any p < 0.25, with a 
corresponding overall arrival rate equal to 4.25. Hence, our 
algorithm increases the throughput by 47% comparing to the 
CSMA-based algorithm. 
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Fig. 4. Average queue length in the ring network 

B. A Random Network 

In this simulation, we randomly place 200 links, each with 
length 20 meters, in a 1000 x 1000 meter 2 two-dimensional 
torus. The carrier sensing range is set to be 200 meters, which 
corresponds to a sensing threshold of —91 dBm. We assume 
Poisson arrivals for each link, and the arrival rate is the same 
for all links. 



--H-- Annealed Gibbs 
-A-- CSMA 



.A. 



y 



--Q- 



■m Tj- i 



20 30 40 50 60 70 80 SO 

overall arrival rate 



Fig. 5. Average queue length in the random network 

For each arrival rate, the simulations is run for 10 4 time 
slots. Fig [5] illustrates the time average value of the total 



queue length in the network under arrival rates and differ- 
ent rate-control algorithms. We observe that the supportable 
throughput is 70 packets/slot under the proposed algorithm, 
40 packets/slot under the CSMA algorithm. Our algorithm 
therefore achieves a 75% throughput gain. 

From these simulation results, we observed that our algo- 
rithm significantly outperforms the CSMA-based algorithm, 
which confirms the importance of adapting transmit powers 
and coding-modulation schemes in wireless network to in- 
crease the network throughput. 

V. Conclusion 

In this paper, we developed a distributed power control and 
coding-modulation adaptation algorithm using annealed Gibbs 
sampling, which achieves throughput optimality in an arbitrary 
network topology. The power update policy emulates a Gibbs 
sampler over a Markov chain with a continuous state space. 
Simulated annealing is exploited in the algorithm to speed up 
the convergence of the algorithm to the optimal power and 
coding-modulation configuration. Simulation results demon- 
strated the superior performance of the proposed algorithm. 
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Appendix A 
Proof of LemmaQ] 

We begin the proof with the following lemma, which states 
that our algorithm simulates a time homogeneous Markov 
chain. 

Lemma 3. For fixed temperature and queue lengths, the power 
configurations generated by the power control algorithm form 
a homogeneous Markov chain with state space V . 

Proof: Let p denote the current power configuration, and 
p denote the power configuration generated by the algorithm. 

First, by observing the procedure of the generation of the 
decision set, it is clear that the decision set T> is independent 
of the power configuration p. Moreover, in the link selection 
stage, the links are selected based on T> and p. Thus, T>i 
depends on p only. Second, for each link (ab) e T>i, the 
new power is sampled from the density function which is 
determined by p and p, while is independent of the earlier 
power configurations than p. The claim then follows. ■ 

Knowing that our algorithm simulates a homogeneous 
Markov chain, the following lemma gives us the transition 
kernel density. 

Lemma 4. Suppose link (ab) is selected to update its transmit 
power, then its power is randomly selected according to the 
following density: 

1 1 Vqb(Pab)- € Pab 

g{p a b\(ab) G V h p xy :yeN (a)) = - — r - e 



where 



Z ab {K t ) 



Z ab (K t )= / e dp 
Jo 



is a normalization constant independent of p a b- 

Proof: The proof is presented in Appendix [B] ■ 
It can be easily verified that all the power configurations 
communicate with the zero power configuration, in which the 
transmit power of all transmitters are zero. Also,the zero power 
configuration has a self-loop, which indicates the Markov 
chain is irreducible and aperiodic and thus ergodic. In the 
following two lemmas, we will show that conditioning on any 
link decision set, the detailed balance equations holds, which 
leads to the conclusion of the lemma. 

Lemma 5. Let 

P(p, P )=P(p(i + l)=p|p(t) = p) 
be the transition kernel probability density, and 

P(p, p\Vt) = P(p(t + 1) = P |X>;,p(i) = p). 
For two power configurations p,p£P, 

Pr(B,|p(t) = p) > 0, and P(p,p\V l ) > 0, 

then, 

Pr(2?,|p(t) = p) = Pr(2?,|p(t) - p), and P(p,p|X>i) > 0. 



In other words, if p is reachable from p in one transition 
with a link decision set T>i , then p is reachable from p in one 
transition with the same link decision set. 

Proof: The proof is presented in Appendix [C] ■ 

Lemma 6. For any link in the link decision set X>; , ;/ 

Pr(X>,|p(i)=p) >0, 

then 

7r(p)P(p,p|^)=7r(p)P(p,p|P i )- 

Proof: The proof is presented in Appendix [D] ■ 
Finally, we have 



7r(p)P(p, p) =tt( P ) P (P> PPi) Pr(A |p) 

=7r(p)^P(p,pp,)Pr(^|p) 

v, 

=7r(p)^P(p,pp ; )Pr(^|p) 
=7r(p)P(p,p), 

where the second equality holds by Lemma [6] and the third 
equality holds by Lemma [5] We then concludes the lemma. 

Appendix B 
Proof of Lemma|4] 

Proof: First, we arrange the critical power levels of (ab) , 
{p c (ab,xy),V(xy) s.t. y£jsf 1 (a)} 
in ascending order, denoted by 

= p c $ < p ca < ■■■ < Pc, m = P maX - 

Recall that the power level of link (ab) is generated using 
the following procedure: 

Node a first selects a power interval \p c ,i>Pc,i+i) with 
following probability: 



Pt(p e \p Ct i,p c ,i+ 

where 



■» = f ( 



Pc,i + 1 \ V ab(P<:,i) 



rn— 1 

Zab = (e"^ 
i=0 



e k, _ e K t e k, 



*Pc,i+l \ v ab(Pc,i) 

e K t . 



Given the interval [p c ,i,Pc,i+i) is selected, then a randomly 
select the power level p a b(t) according to the following 
probability density function(pdf): 



E Pc,i+l 



e _ * / £p c,i 
fab\P\P e \pc,i,Pc,i+x)) = -j£ e K * (e K * - 

For any power level p £ [0,p™ ax ], let us consider the 
probability density, it must in an interval [p c ,i,Pc,i+i) f° r some 
i, i S {0, 1, . . . , m — 1}. Hence, p is selected according to the 
following density: 
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fab(p) = fab(p\p € \Pc,i,Pc,i+l)) Pr (P G [Pc,i,Pc,i+l)) 

1 £ Vgb(?c,i) 



Zab K t 



We then need to show when p E [p C) j,p C; j + i), 

^ > ati(Pati)-' ! Pat 



/*• e ** dp 
which is equivalent to show that 



1 £ _£Pab ^tb-c,,) 

e R 't e K t 



e K t dp = Z ab — . 



We simplify the LHS term by computing the integral in 
each interval \p c ,i,Pc,i+i)- 



Pc,i + 1 V ab (p)~ep 

e K * dp. 



Notice that, by the definition of critical power level, the 
modulation of link (ab) and all the links (xy), y G AT 1 (a), 
will not change if p a i, varies between two adjacent critical 
power levels p C i and p c ,i+i- Thus, 

V a b{Pab) = ^2 r xy q xy = V ab (p Cti ) 

(xy):y<£M 1 (a) 

is a constant, when p £ \p c ,i,Pc,i+i)- Hence, 

Pc.«+1 V ab (p)-ep 

e K t dp 

Pc,i 

-e K t I e K t dp 

Jpc,i 
Kt ^b(Pc.i) -ep 

= ^e^^t e -KT\Po,i+i 



itPc.i) / _ Pc.i 



— e K t 



Therefore, we have 



dp 



>o 

m— 1 



. Pc,i + 1 v a6 (p)-.p 

= 2^ / e dp 

i=o 

m— 1 r , i> , , 

EfQ Vab(Pc,0 I _Pa J± Pc,i + 1 

— e K t (e K « -e K * 

e 

The Lemma then follows. 



Appendix C 
Proof of Lemma[5] 

Proof: Suppose 23/ is a link decision set with 
Pr(2?j|p(t) = p) > 0. A power configuration p is reachable 
in one time slot from p, i.e., P(p, p|23;) > 0. Assume that 23 
is the decision set corresponding to Di, we then have 

Pr(23|p(i) - p) > 0. 

Remember that the decision set is generated independently of 
the power configuration, which means 

Pr(2?|p(t) = p) = Pr(23|p(i) = p) > 0. 

Now we consider an arbitrary link (ab) G 23; , which implies 
that a E 23. There are two different cases: 

1) If node a has only one outgoing link, then the event 
(ab) G 23/ and the event a 6 23 are equivalent. Thus, 

Pr((o6) G X>i|p(*) = P) = Pr(a G 2?|p(t) = p) 
= Pr(a G V\p(t) = p) 
= Pr((ab)eX>z|p(t)=p) 



2) If node a has more than one outgoing link, since link 
(ab) G Di when the power configuration is p, then no 
other outgoing link, i.e., (ac) can be active in p. In other 
words, p ac = 0,V(ac) ^ (ab),(ac) G £. Since (ac) is 
not in the link decision set, 



Pac = Pac = 0. 



Hence, 



Pr((o6) G X>j|p(*) = p) = T Pr(a G X>|p(t) = p) 



l 



Pr(a g X>|p(t) = p) 



= Pr((a6) e 2?,|p(t) = p), 

Notice that the equalities hold no matter p a b > or not. 
Another observation is that, given a node a is in 23, the event 
(ab) G 2?/ is independent of the other links (cd) with different 
transmitter. Thus, we have 

Pr(2?,|p(t) = p) 
= Pr(2?,,2?|p(t) = p) 
= Pr(23/|23,p(t) = p) Pr(23|p(t) = p) 

- H Pr((a&)G23,|23,p(t) = p)Pr(23|p(t)=p) 

(ab)GX>( 

= [] Pr((a6)e 2?,|23, p(t)=^)Pr(2?|p(t)=p) 

(ab)£T> t 
= Pr(23/|p(i) = p) 

Next, we show that 

P(p,p|23;)>0 (11) 

Recall that in the power control algorithm, if (ab) G 23;, then 
link (ab) randomly generate a power level according to the 
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distribution in equalities ([8]l and (|9j, which is always positive 
for any candidate power level p £ [0,p™ ax ]. Thus, if (ab) £ 
T>i , and link (ab) can change its power from p a b to p a b with 
positive probability, it then can change its power from p a b 
to p a b. This holds for each link in the link decision set. As a 



Hence, 

7T(p) 
7r(p) 



,[(,V(p)-eY. {ab)eE Po. b )'(V(-p)-eY. lab)e eP^)]/K t 



„E(a6)el3,[(Va6(p)-£Pab)-(Val,(P)-epa b )]/ift 



(12) 



result, we have equality (Hi. The statements in the proposition 
are then proved. ■ 

Appendix D 
Proof of Lemma[6] 

Proof: By the definition of 7r(p) : 

1 



n 

(ab)GX>i 



,[V ab (p)-ep ab ]/Kt 
,[V ab (p)-ep ab ]/Kt' 



7f(p) 



Z(K t ) 



JV(p)-eJ2 (ab)ee .p ab )/K t 



we have 

7r(p) 
7r(p) 



,[(V(p)-eJ2 (ab)e£Pab )~(V(p)~ej: (ab)e£ p al ,)]/K t 



We now show how the weight V(p) is divided into local 
weight according to the link decision set T>i . With a little abuse 
of notation, we define 

^(V^^U^^Kxy) ■.y£AT 1 (a)}. 

For each link (xy) , one of the following two situations must 
be true. 

1) y £ Af 1 (a) for some node a, such that (ab) £ T>i; while 
y ^ A/' 1 (c) for any other node c ^ a, (cd) £ T>i. 

2) y^N 1 ^). 

The first situation is true, because otherwise y £ Af 1 (a) n 
A^ 1 (c), which contradicts Lemma [5] 
Therefore, we have 

V(P) = E f *b(V*)qab 

(ab)££ 

= E 



Since each link (ab) £ T>i in the decision set updates its 
transmit power level independent of the other links in the 
decision set, given the transmit power of the links {(xy) : 
x E Af 1 (a) U7V 2 (a)}, we have 

= II g(Pab\Pxy ■ x G TV 1 (a) U7V 2 (a), (ab) £ Dj) 

(ab)£T>, 

= n 



AV*b(p a b)-ePab]/K t 



(ab)EVi 



Zab(K t ) 



Notice that Z a b(K t ) depends only on the power level of the 
links {(xy) : x £ Af 1 (a) UAf 2 (a)}, and the power level of 
these links are the same in p as in p. We have 



P(p,p|P t ) 
P(P,p\Vi) 



n 

(ab)eX)i 
_7T(p) 



,[V ab (p)-ep ab ]/Kt 
AVab(p)-ep ab }/Kt 



■HI 



fa)* 



a: 2/ 



E 



.ri/ 



(p)&» 



(P)« 



= E E ? *3/(p)fe/ + E f ^(P)<?zy 

= E ^(p)+ E f ^ 

Similarly, 

= E ^ Qb ( p ) + E f xy(p)Qxy 
(ab)EV l (x3/)^ATi(Z>,) 

Note that if (xy) ^ then the virtual rate of (xy) 

remains the same when the power is changed from p to p, 
because none of y's neighbor changes its transmit power. 
Therefore, we obtain 

V( P )-V(p)= J2 Vab(p)~ E Vab(p). 

(ab)GD, (ab)GP, 

Notice that only the links in T>i changes their power level, 
it is obvious that 

E Pab ~ E P ab = E Pab ~ E P ab - 

(ab)e£ (ab)e£ (ab)eV l (ab)eV, 



7T(p)' 

which yields the result in Lemma [6] ■ 

Appendix E 
Proof of Theorem|2] 

In Gibbs sampler, the states of the links are updated in 
sequential and in a deterministic order. This deterministic 
updating scheme makes sure that a full sweep is finished in 
n time slots, which establish a lower bound on the transition 
density between any two power configurations. We then show 
that under our stochastic updating scheme, a similar lower 
bound on the transition density can be obtained. Please notice 
that all the power mentioned in this proof are virtual power, 
we omit the tilde here for simplicity of notation. 

First, let d aM = \{(xy) : x £ Af 1 (a) UA/" 2 (a)}|, which is 
the number of transmitters who contend with a for being in 
the decision set. Recall that d a denotes the outgoing degree 
of node a, i.e., d a = \{b : (ab) £ £}\. Further, let dg = 
max {max( ot)e£ d ab , max aeV d aM } . 

For the simplicity of notations, define 



and 



P{h,P2\h,Pi) = P(p(t 2 ) = p 2 |p(*i) = pi) 



P(t 2 ,P2|*i,7r) = / P(t 2 ,p2\ti,p)ir(p)dp. 
Jv 



Let ll/x— v\\ denote the L 1 distance between two distribu- 
tions on P : — = J v \fi(p) — u(p)\dp. Note that if 
||/i„ — fi\\ — >• 0, then /i„ — > fi as n —> oo, except perhaps on 
a subset of V with Lebesque measure 0. 
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Lemma 7. Define r = \ 2{dg+ ^ log " l, w/zere d = 0.18 is 
a constant. Let F{t) denote the event that each transmitter is 
selected in the decision set at least twice during [t + 1, t + 2r], 
then we have 



Pr(F(i)) > l-2n _1 ,t= 1,2,3,.... 



Proof: First, we show that with stochastic updating, 
all the transmitters are selected in the decision set at least 
twice during [t + l,t + 2r] with high probability, for any 
t = 1,2,3,... 

Let us consider an arbitrary transmitter a. Notice that in 
the decision set generation stage, the decision set is generated 
independent of the state of the transmitter, and is independent 
between different time slots. 



Pr(a e V) = Pr(T a = 0,T X > 0,Vx e TV 1 (a) UAf 2 (a)) 

+ Pr(T a = l,T x > l.Vare Af 1 (a) U Af 2 (a)) + . . . 
+ Pr(T a = dg-l,T x = dg,Vx€ Af 1 (a) Li AT 2 {a)) 
1/1 \ dab 1 / _ 2 \ dab 

dg \ dg J dg \ dg J 

+ — ( — 
dg \dg 

dg-1 1 



dently of the previous ones, we have 

t + T 

Pr(S„(t)) = l- J] (l-Pr(aeX>)) 
fc=t+i 

= 1 - (1-Pr(a G £>)) T 

2(d g + l)log. 



(do + 1) 



2c 1 (dg+l)logr 

> 1 - e (rfg+D-i 



1 - e 



- 2 log n 



Then, let i? a (i) denote the event that a is selected at least 
twice in the decision set during [t+ l,t + 2r]. Obviously, if 
a is in the decision set at least once during [t + 1, t + r], and 
during [t + t + 1, i + 2r], then H a (t) is true, and we have 

Pr(ff (i)) > Pr(S a (t))Pr(S a (t+r)) = (l-n~ 2 ) 2 > l-2n~ 2 

Let F{t) denote the event that every transmitter is selected 
in the decision set at least twice during [t + l,t + 2r], 
i.e.,F(t) = Plaev H a {t)- By using the union bound, we then 
have 



d s -l 

fe=0 
/■dc-l 
> dg^- 1 / 

= ^- 1 -i_ (dg-l) d °> +1 
dab + 1 
^ \ d ab + l 



Pr(F(f))=Pr ( f| H a (t)\ 
\aev ) 

= 1-Pr( |J H a (t)) 

>l-£Pr(jT a (t)) 

> 1 -nsupPr(# (i)) 



> e- 1 1 



>ci 



rfe / d ab + 1 

rf g / d ab + 1 

1 \ 1 



= l-n 1 - inf Pr(£T a (t)) 

> 1 - n ■ 2rT 2 

> 1 - 2n _1 , 

where the second inequality holds because the number of 
transmitters is no more thans the number of links in the 
network, i.e., | V | < n. Thus, we have shown that during a time 
interval of length 2r, where r = 0(dg logn), with probability 
greater than 1 — 2n~ 1 , each transmitter is selected in the 
decision set at least twice. ■ 
Next, we will show that given F(t), P(t+2r, p\t, p) > S(t) 
for any p,pe?. 



1 

dg + l 



(13) 



Lemma 8. 



inf P(t + 2T,p\t,p) > 6(t) 
P,f>ev 



constant. 



where a = 0.18 < e~ x \ < er 1 (l - j^j is a 

Let S a (t) denote the event that transmitter a is selected at 
least once in the decision set during time interval [f+l,t + r]. 
Since the decision set is generated at each time slot indepen- 



for any t = 1,2, ... , where S(t) = c 2 e" 2 ™ A / Kt + 2T , and c 2 = 
(1 - 2n- 1 )(p™ ax dg)- 2n . 

Proof: 

Notice that in the power control algorithm, not every link 
whose transmitter is in the decision set can update its power. 
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We define T>\ to be the set of links who can update their 
transmit powers, called the link decision set. The intuition of 
this Lemma is that, given F(t), each link has the chance to 
update its power, and the transition density between any two 
power levels of the link is lower bounded. First, if a has only 
one outgoing link (ab), then (ab) must have the chance to 
update its power when a is in the decision set. On the other 
hand, if a has more than one outgoing link, and assume p ac > 
at time t, p(ab) > at time t + 2r. Since it is in the decision 
set twice, it can turn off (ac) at the first time it is in I?, then it 
can change the power of link (ab) at the second time that it is 
in T>. Let us then derive the lower bound on P(t + 2r, p\t, p). 

For a transmitter a that has only one outgoing link (ab), 
assume that the last time a e D is at time slot t a . We then 
know (ab) € T>i at time t a , and 

P( Pab (t + 2t) = p ab \p(t a - 1), (ab) G T>i) 

=P(Pab(ta) = Pab\p(ta ~ 1), (06) G V X ) 
- 1 JVab(Pub)-e(Pab)}/K t 



>- 



Zab(K t ) 

e lV ab (Pab)-e(Pab)]/K t 



> 



(U.-U*)/K ta 



>[0,K 
1 



> 



jjmax 
1 

( 

Pa 



-A/K t+2T 



> e -2A/JC 4+ar ^aax d{; j-a < 

For transmitter a that has more than one outgoing link, assume 
that the last two times a G T> is at time slots t a \ and t Q 2, 
respectively. Notice that only one of outgoing links of a can 
be active at each time slot. Suppose at time t a t, link (ac) is 
active, and in p, at time t + 2r, link (ab) is active. 

P( Pab (t + 2T) =p ab \p(t) =p) 
= P(Pab(ta2) = Pab\p(ta2 ~ 1), (ab) G T>l) 

P(Pac(t a i) = 0|p(*oi - 1), (ac) G P;) 
> J_ e -A/Jf« al ri± e -A/K lo2 d -i 

— ijinax a ^max « 

.r'a .Fa 

> e -2A/if t+2T ^max rfe )-2^ 

Clearly, if none of the outgoing link of a is active at time t a \, 
then 

P(p ab (t + 2r) =p ab \p(t) =p) 

= P(Pab(ta2) = Pab\p(ta2 - 1), (ab) G Z>j) 
1 



— , n max u 
ra 

> e -2A/if t+2T ^max rfe 



Thus, given each transmitter is in the decision at least twice 
during [t + l,t + 2r]„ i.e., given F(t), we have 

P(t + 2r, p|t, P ) > J] e- 2A/Kt+2T br x ^)^ 2 
aev 

> e -2nA/K t+2T ^max de )-2r^ 



Together with Pr(F(t)) > 1 — 2n 1 , we have 
P(i + 2r, p|£, p) > (1 - 2n- 1 )e- 2nA/Xt +^ (pT* d g)~ 2n - 

Let c 2 = (l-2n- 1 )(p™ ax d g )- 2 ",and(5(t) = c 2 e- 2nA / Kt + 2 - , 
the lemma then follows. ■ 



Lemma 9. Lef 



A'n 



K t = 



logj2+t 



log(2+JV) 



< t < N, 
N <t, 



(14) 



where N £ N is a fixed integer. Then, for every to = 
0,1,2,..., 

lim sup \P(t, p t \t , po) - P(t,pt\t ,po)\dp t = 0. 

Proof: Define T k = t + 2kr, k = 1,2,.... First, we 
consider a fixed p t . 

P(t, pt\to, po) = J P P(t, p t \T x , p)P(Ti, p|t , Po)dp- 

Let be the set of power configurations that minimize 
P(t,p t |ri,p), i.e., 

V L = argminP(t,p t |Ti,p), 

and p* be an arbitrary element in Vl- Further, for given 
e > 0, we define a small neighborhood around the power 
configurations in Vl '■ 

P L ,s = {p-- \P(t,p t \T 1 ,p)-P(t,p t \T 1 ,p*)\ <e}. 

Similarly, we define 

Vu = argmaxP(i,p t |Ti,p), 
pev 

and p* be an arbitrary element in Vu- Also, we define 

Tu, e ^ {p : \P(t,p t \T u p) ~ P(t lPt \T llP *)\ < e}. 

Next, we derive an upper bound and an lower bound on 
P(t,p t \t ,Po) for different p . 

P(t,p t \t ,po) 
= J v P(t, p t \T 1 ,p)P(T 1 ,p\t , Po )dp 
= J Vus P(t,p t \T 1 ,p)P(T 1 ,p\t ,p )dp 

+ J PLs P(t,p t \T 1 ,p)P(T 1 ,p\t ,p )dp 

+ /p\(^, E uP i , e ) F (^P*l :r i'P) P ( T i'Pl*0'Po) d P 
Since J v P(T\, p|t , Po)dp = 1, and by Lemma [8] we have 

p(t x , p\t , po) ><y(to). 

Hence, the maximum value of P(t, Pt|toj Po) is obtained by 
letting P(r 1)P |i ,Po) = S(t Q ) for all p £ V v . e . Thus, 

L P(T llP \t 0l p ) <l-a(P\V Ute )8(to), 
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where a is used to denoted the Lebesque measure of a set, in 
order to distinguish with the modulation function. Hence, we 
have, 

P(t,p t \t ,Po) 
<P(t, p t \T t , p*) J Pue P(T, ,p\t , p )dp 

+ J Vl e P(t, Pt \T 1 ,p)P(T 1 ,p\to, Po)dp 

+ J VVVucUVLE) P(t,Pt\Ti,p)P(T 1 ,p\t ,p )dp 

<P(t, pt|Ti,p*)il - a(V \ Vu, s )8(t )] 

+(P(t,p t \T 1 ,p,) + e)a(V L ,e)S(t Q ) 

+ s ( t o) U\{Vu,eur L , e ) P (*> P* l Tl > P) d P' 

Similarly, we can obtain a lower bound on P(t, pt\to, Po) as 
well: 

P(t, p t \t ,Po) >P(t, Pt\Tx, p*)[l - <r(P \ p£, e )£(*o)] 
+ (P(t, p t \T 1>P *) - e)a(Vu, e )6(t ) 

+ S (to) Iv\(Vu,^V L , s ) P (*' P* l Tl ' P) d P' 

Hence, by taking the difference between the upper bound and 
lower bounded derived above, we have that for any p t , po , 
and p , 

|P(t,p t |i ,po) - P(t,p t |to,Po)| 

<|P(t > P*|ri > p*)-P(*,p t |r 1 ,p»)|[i- £ r(p)«(t )] 

+ ed(t )(a(Vu, £ ) + a(V L , £ )) 

<(P(t,Pt|Ti, P *) - P(t,p t |T 1)P *))[l - <7(P)5(t )] 

= sup \P(t, p t |Ti , P ) - P(i, pt |Ti , P ) | [1 - <7(P)*(to)] , 
p,p 

where the last inequality holds by the definition of p* and p* , 
and also because e is arbitrarily small. Since the inequality 
holds for any Po , Po , we have 

sup |P(t,p t |* ,Po) - P(*,Pt|*o,Po)| 

Po.Po 

< sup \P(t, ptlTi.p) - P(t, p t \T u p)|[l - a(V)S(t )}. 
p>p 

Let k = L^f 2 "] i sucn m at T/- < t is the nearest time point to 
£ which is of the form tn + 2ir. Notice that K t > ,_ 2 "^ , and 



S(t) = C2e -2«A/A' t+2 
Hence, 



Lq-tah . i>uui<c uiiii i\ t ~_ log ( 2+t ) ■ 

we have 5(t + 2ir) > s - 



2+t +2(i+l)T- 



sup |P(*,p t |t ,Po) - P(*,Pt|*0)Po) 
Po,Po 

<sup|P(t,p t |T fe ,p)-P(t,p t |T fe) p)| 



pp 

fc-i 



Y[[l-(T(V)5(to+2iT)} 



i=0 

<sup\P(t,p t \T k ,p) - P(t,p t \T k ,i>)\ 
pp 

fc-l r 



n 

t=0 



1 - 



C2<J{V) 



2 + t Q + 2(i + l)T 



By Lemma |4j it can be easily verified that \P(t,pt\T).,p) — 
P(t, pt|2fc, p)| is bounded for each p f ,p, p, and t. Further, 
since limfc_ J . 00 J2i=o(^o + 2(i + l)r + 2) = oo, we have 



fc-i r 



lim I 
k— J- J- 



1 - a(P) 



C2 



2 + < + 2(i + l)r 



= 0. 



Note that k — » oo as i — > oo, we then obtain 

lim sup |P(t,Pi|£o, P o) -P(t,Pt|*o,Po)| = 0, 

i->oo p t . po . po 

which concludes the lemma. ■ 

Lemma 10. For any given e > 0, there exists N € N swcn 
f/iaf /or any t > to > N,t* > N, we have 

\\P(t,-\to,7TK t ,)-7r K J\ <e. 

Proof: 

First, we claim that for any t > to > 0, and starting with 
any distribution /i, 

\\P(t,-\t ,fi)-n Kt \\ < ||P(t-l,.|to,M)-7rjc t ||. 



\\P(t,-\to,n)-ir Kt \\ 
= f v |P(*,p|*o,M) ~ ^K t (p)\dp 
= I v \Sv-d x p (*)(PfpP-»i)l*- 1 dPv l ,P-v> l ) 

P(t - 1, (px^p-xOlfo,/*) 

- 7TA' t ((pX) ! ,P--D ! ))P(^ (pZ>i,P-X>i)l* - 1) (pX>,,P-X> i ))^PX> 1 |dp 
<fvfv Vl P (^(PT> V P-Vi)\t- l^pppP-K,)) 

|P(i- 1, (pz>,,P-z>i)l*o,M) " Tjr t ((p©i,P-2> l ))|d Px >i# 

- Tift ((Pz>, , P-p )) \dpvt dp-v, 

U Vi P(t,(pD,,P-Vi)\t-l,(pT>,,P-Vi))dpT3 t 

= U-„ l I^C* — x ' (Px> z ,P-z? t )!io,/^) 

= Sv l P ( t_ i^PX-uP-X-Jltc/i) -^ t ((PD,,P-X>j))MP 

=I|P(*- i,-|*o,m) — tic*I| 5 

where the second equality holds by Lemma [T] i.e., 

1"K t (p) = 

/ P5Jj TK t ((PZJ, , P-V,))P(t, (Pv, , P-X>, )|* - 1, ( P Z>, , P-X> ; ))dpT>, ■ 

Observe that as t — > oo, irK t will have higher probability in 
each optimal power configuration. It can be shown that there 
exists an N 6 N large enough, such that for t > N, irK t (p) 
is strictly increasing in t, for each p <= V*; while 7r^ t (p) is 
strictly decreasing in t, for each p ^ V* . Thus, we have 
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We then have that, starting with distribution TTK t * at ti me 

to, 

\\P(t,-\t ,-K Kt „)-TT Kt ,\\ 

<\\P(t,-\t ,ir Kt ,) -TT Kt \\ + \\jr Kt -ir Kt ,\\ 
<\\P{t - l,-\t , ir Kt ,)- n Kt \\ + \\ir Kt - n Kt ,\\ 

<\\P(t-l r \to,ir Kt ,)-irK t ^\\ 

+ IKkj-! - "if* || + ~ KK t , || 



<\\P(t O ,-\t0^K t ,)-7T Kto \\ 
t—1 

+ X! 1 1 71 "-^™ ~~ n K m+1 1 1 + \\^K m - 7TK t . 1 1 
m—to 

The first term, ||P(to, -\to, ^K t , ) — 7T K tQ \\ = 0, because we 
assume that the process starts with distribution ^K ta = ^K t * ■ 
Since Y^tLi \ \ n K t — 7TK" t+ i|| < oo, we have that, for given 
e > 0, there exist N € N, such that for t > t > N,t* > N, 



t-i 

IK*- 

m— io 



< 



2' 



and 



max{f * ,t}— 1 

Kif t - 7TK t , || < IK^ 



m— mm 



^ m+1 |l<2" 



The lemma then follows. ■ 
Finally, we show how to establish Theorem [2] by Lemma [9] 
and Lemma [10] 

||P(V|0,p)-7rK t «|| 
= \\Iv '1*0' P) P ( f o, P|0, p)dp - 7r Kt , || 
< \\U P(t, -K P)P(to, p|0, p)# - P(i, .|t , ** t . ) || 
+ ||P(t,-|*o,7rjc t .)-7rjc t .|| (15) 

By Lemma [TO] for given e > 0, there exists Ni € N, such 
that if t > t > Nx, t* > Nx, 



\\P{t,-\to,n Kt ,)-Tr Kt ,\\ < 



(16) 



Consider the first term, 



|| Jp P(t, -|*o, P)P(t0, P|0, p)# ~ P(t, '1*0, 7TK t , ) || 

= ||/ 7 ,P(t,-|i O) p)-P(to,p|0,p)dp 

- /p-P(^-|*o,p)7rjr t .(p)dp|| 
- ||/p PC*, -1*0, P)(P(to, P|0, p) - TTjc,. (p)) || 
= /p I/p^P^o-pIWo.pIO,?) - 7rjf t . (p))dp| dp 

= r p |/ p (P(t, pjt Q , P ) - P(t, P \t ,p)) 

{P(t Q ,p\0,p) - 7T Kt ,(p))dp\dp 

< U I v \(P(t, pM) - P(t, p\to, p))| 

\(P(t , p\0,p) - n Ktt (p))\dpdp 



By Lemma |9| for any given e > 0, there exists N e N, 
such that for t > N. 



\(P(t,p\t ,p) - P(t,p\t ,p))\ < 



Thus, 



\\j v P(t, -\t , P)P(t , p|0, p)dp - P(t, -\t , n Kt . ) || 

< Iv Iv K p (*> Pl*o. P) - P^ Pl*d. P))l 

|(P(t ,p|0,p)-7r^(p))|dpdp 
-Ivlv 8n a / v pr I (•f'^o, P|0, p) - 7TRr t . (p))| dpdp 

£ 

<-. 
-4 



(17) 



Hence, by equalities (15 1,(16 1 and (17i. 



||P(t,-|0,p)-7r K J| < 



Notice that for given e > 0, there exists N2 € N such that 

for t* > AT 2 , we have 



Jp, TTK t , (p)dp > 1 - e/2. 



(18) 



By selecting N = max{iVi, N2}, inequalities (I61 and (18 1 
hold, which imply that 



Jp,P(t,p|0,p)# 

> Sv; (P) - P' ' P) - n *r (P)l rf P 
= Iv s n K t , (p)rfp - f P . \P(t, p|0, p) - TT Kt , (p)\dp 
e e 

>l-s. 



We then conclude the theorem. 
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Appendix F 
Notations 

Q = (V, £) The graph model of the network. 

V The set of the nodes. 
£ The set of links. 

A/" 1 (a) The set of one-hop neighbors of node a 

J\f 2 (a) The set of two-hop neighbors of node a 

V The decision set, the nodes in which can 
update their transmit powers. 

n The number of links in the network, i.e., 

\£\. 

q a b The queue length of link (ab). 

p ab Transmit power allocated to link (ab). 

V The power configuration space. 

g a b The channel gain from the node a to node 

b. 

r a b The capacity of link (ab) . 

-f ah The SINR of link (ab). 

T a b The partial-noise-plus-interference experi- 

enced by link (ab). 

M a b The set of feasible coding-modulation 

schemes of link (ab). 

V(p) The virtual weight of the network, i.e., 

E(a6)e£ f ab(p)ga6- 

V a b(Pab) The virtual local weight at link (ab), i.e., 

jxy.m The SINR requirement of link (xy) corre- 

sponding to coding-modulation scheme m. 



Pc,m(ab, xy) The critical power of link (ab). When (ab) 
uses this power level, the SINR of link 

(xy) is ^jxy^m- 



